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A METHOD FOR IMPLEMENTING FAST TYPE CHECKING 



FIELD OF THE INVENTION 

[0001] The present invention relates generally to the data structures of object-oriented 
languages, and more specifically to methods and apparatuses to provide faster data 
structure type checking. 

BACKGROUND OF THE INVENTION 

[0002] Object-oriented computer programming languages such as JAVA and C# 
typically employ type hierarchies (concept 'type' and 'class' can be used interchangeably 
in this document). In computer languages, types are used to describe a given entity. For 
example, a computer language may employ a data structure having various fields used to 
describe the subject type. The information contained within the fields uniquely defines 
the type. Normally, types have hierarchy, i.e., one type can be a subtype or supertype of 
another, e.g., type "apple" is a subtype of type "fruit", and is a supertype of type "red 
apple". Once the type is determined for some data in computer memory, it may be 
necessary to determine whether that type is a subtype of another type. The type hierarchy 
may be viewed as a type tree having a root type base with subtypes of the root type, and 
subtypes of the subtype, etc. At run time, these languages determine if one type is a 
subtype of another. Type checking is a very common operation in object-oriented 
languages. This checking is accomplished through use of instructions at runtime, e.g., in 
virtual machines. 

[0003] Figure 1 illustrates a type checking process in accordance with the prior art. 
The system 100, shown in Figure 1, includes data structures 105 through 109. Typically, 
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data structures, that are stored in computer memory, contain among other data, a type 
field and a pointer to a supertype field. To determine the type of data structure 105, the 
type field 105A is checked and data structure 105 is determined to be of type E. It may 
also be necessary to determine if type E is a subtype of another type, for example it may 
be necessary to determine if type E is a subtype of type B. This is accomplished by 
checking the supertype pointer field 105b of data structure 105. Supertype pointer field 
105b provides a pointer to the supertype of type E. The pointer is dereferenced to obtain 
type E's supertype (i.e. type D located at typefield 106a of data structure 106). Likewise, 
the supertype of type D is determined by obtaining a pointer to D's supertype and 
dereferencing the pointer. The process is continued until it is determined that type B is 
supertype of type E (or conversely that type E is a subtype of type B). In general, this 
process is done recursively until it is determined that a given type (e.g., type E) is a 
subtype of another type (e.g., type B) or until a root type is reached. In system 100, type 
A is a root type, that is type A is not a subtype of any other type. This is indicated by the 
fact that supertype pointer field 109B is null. 

[0004] In system 100, each time a supertype pointer is obtained and dereferenced the 
process requires memory access. Such recursive memory access taxes processing 
resources and is time consuming. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0005] The present invention is illustrated by way of example, and not limitation, by the 
figures of the accompanying drawings in which like references indicate similar elements 
and in which: 

[0006] Figure 1 illustrates a type checking process in accordance with the prior art; 
[0007] Figure 2 is a diagram illustrating an exemplary computing system 200 for 
implementing a fast type checking method in accordance with an embodiment of the 
present invention; 

[0008] Figure 3 illustrates a type hierarchy tree and a corresponding data structure in 
accordance with one embodiment of the present invention; and 

[0009] Figure 4 is a process flow diagram in accordance with one embodiment of the 
present invention. 
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DETAILED DESCRIPTION 

[0010] The present invention provides methods and apparatuses for allowing faster data 
structure type checking. In one embodiment, the prior art data structure that includes a 
type field, and a field for storing a pointer to the supertype of the data structure type is 
replaced by a data structure that includes a sub-root log to store successive supertypes 
(type hierarchy references) of the data structure type hierarchy. Alternatively, the type 
data structure may not contain the sub-root log, but may contain a pointer to the sub-root 
log. Currently, typical applications have at most seven type-hierarchy references with a 
majority having no more than three. This means that by implementing a sub-root log 
storing six hierarchy references within the data structure, virtually all, typical, 
applications can be type checked without recourse to the recursive prior art method. In 
one embodiment, three fields are used to store the three successive references to a given 
type's supertype hierarchy. In an alternative embodiment, all references to a given type's 
supertype hierarchy may be stored in a type data structure. In another alternative 
embodiment, the number of supertype references used may be dynamically determined at 
run time for a given application. 

[0011] Figure 2 is a diagram illustrating an exemplary computing system 200 for 
implementing a fast type checking method in accordance with an embodiment of the 
present invention. A data structure containing multiple, successive, type hierarchy 
elements can be implemented and utilized within computing system 200, which can 
represent a general-purpose computer, portable computer, or other like device. The 
components of computing system 200 are exemplary in which one or more components 
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can be omitted or added. For example, one or more memory devices can be utilized for 
computing system 200. 

[0012] Referring to Figure. 2, computing system 200 includes a central processing unit 
202 and a signal processor 203 coupled to a display circuit 205, main memory 204, static 
memory 206, and mass storage device 207 via bus 201. Computing system 200 can also 
be coupled to a display 221, keypad input 222, cursor control 223, hard copy device 224, 
input/output (I/O) devices 225, and audio/speech device 226 via bus 20 L 
[0013] Bus 201 is a standard system bus for communicating information and signals. 
CPU 202 and signal processor 203 are processing units for computing system 200. CPU 
202 or signal processor 203 or both can be used to process information and/or signals for 
computing system 200. CPU 202 includes a control unit 231, an arithmetic logic unit 
(ALU) 232, and several registers 233, which are used to process information and signals. 
Signal processor 203 can also include similar components as CPU 202. 
[0014] Main memory 204 can be, e.g., a random access memory (RAM) or some other 
dynamic storage device, for storing information or instructions (program code), which are 
used by CPU 202 or signal processor 203. Main memory 204 may store temporary 
variables or other intermediate information during execution of instructions by CPU 202 
or signal processor 203. Static memory 206, can be, e.g., a read only memory (ROM) 
and/or other static storage devices, for storing information or instructions, which can also 
be used by CPU 202 or signal processor 203. Mass storage device 207 can be, e.g., a 
hard or floppy disk drive or optical disk drive, for storing information or instructions for 
computing system 200. 
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[0015] Display 221 can be, e.g., a cathode ray tube (CRT) or liquid crystal display 
(LCD). Display device 221 displays information or graphics to a user. Computing 
system 200 can interface with display 221 via display circuit 205. Keypad input 222 is 
an alphanumeric input device with an analog to digital converter. Cursor control 223 can 
be, e.g., a mouse, a trackball, or cursor direction keys, for controlling movement of an 
object on display 221. Hard copy device 224 can be, e.g., a laser printer, for printing 
information on paper, film, or some other like medium. A number of input/output 
devices 225 can be coupled to computing system 200. Data structures containing 
multiple type hierarchy references, in accordance with the present invention, can be 
implemented by hardware and/or software contained within computing system 200. For 
example, CPU 202 or signal processor 203 can execute code or instructions stored in a 
machine-readable medium, e.g., main memory 204. 

[0016] The machine-readable medium may include a mechanism that provides (i.e., 
stores and/or transmits) information in a form readable by a machine such as computer or 
digital processing device. For example, a machine-readable medium may include a read 
only memory (ROM), random access memory (RAM), magnetic disk storage media, 
optical storage media, flash memory devices. The code or instructions may be 
represented by carrier-wave signals, infrared signals, digital signals, and by other like 
signals. 

[0017] Figure 3 illustrates a type hierarchy tree and a corresponding type data structure 
in accordance with one embodiment of the present invention. System 300, shown in 
Figure 3, contains a type hierarchy tree 310 in which type A is a root type. The type 
hierarchy tree 310 may also illustrate a portion of a larger type hierarchy tree. As 
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illustrated by the type hierarchy tree, type A is a supertype of type B, type B is a 
supertype of type C, type C is a supertype of type D, and type D is a supertype of type E. 
As shown, type A is also a supertype of type S. In languages such as JAVA and C#, a 
given type may have more than one subclass, but each type may have at most one 
supertype. That is, multiple inheritances are not allowed, a type may inherit from at most 
one supertype. 

[0018] In accordance with an embodiment of the present invention, at run time, objects 
that are of type E are represented with data structure 320 that contains multiple references 
to the type hierarchy tree. A small piece of memory is used to cache the references. In 
one embodiment, the cache may contain some subset of the entire type hierarchy tree, for 
example, three references. Alternatively, all of the references to type E's supertypes may 
be cached. 

[0019] Therefore, the present invention allows, in one embodiment the quick 
determination of type hierarchy. For example, if it is necessary to determine if type B is 
a supertype of type E, it is only necessary to examine the data structure of E which 
contains three supertype reference levels including type B, If the supertype level to be 
checked is greater than that contained within the type data structure, then a recursive 
process may be used. For example, if it is necessary to determine whether type A is a 
supertype of type E, the data structure of type E is examined. Type B is determined to be 
the highest referenced supertype of type E. The data structure of type B is then examined 
and type A is determined to be supertype of type B, and hence type A is determined to be 
a supertype of type E as well. 
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[0020] Empirically it is found that a data structure containing three type hierarchy 
reference levels is sufficient to allow type checking without resort to a recursive process 
for a majority of applications. The type hierarchy tree for most typical applications 
contains no more than seven levels. A data structure containing a root tree log with six 
references, therefore, may suffice to provide fast type checking without referring to the 
data structure of an intermediate supertype. 

[0021] It will be appreciated that the method of the present invention contemplates any 
number of root tree log references, with the number implemented dependent on the 
specific application and such practical concerns as memory resources versus processing 
resources. 

[0022] Figure 4 is a process flow diagram in accordance with one embodiment of the 
present invention. The process 400 may be used to determine if object X is of type Y; 
typically, is type Y (a query type) a supertype of the type of which object X is a member 
(the object type). The process 400, shown in Figure 4, begins at operation 405 in which 
an evaluation is made to determine if the type of object X is equal to type Y (that is, is 
object X of type Y). If so, the process returns true at operation 410. If not, the process 
continues at operation 415 in which the depth of type X and type Y within the type 
hierarchy tree is compared. An index value equal to the depth value of type X minus the 
depth value of type Y is computed. If the index value is less than or equal to zero (i.e., 
type Y is deeper than type X), this indicates that type Y is impossible to be a supertype of 
type X and the process returns false at operation 420. If the index value is more than zero 
then the index value is compared to the number of type hierarchy references within the 
sub-root log at operation 425. If the index is equal to or smaller than the number of type 
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hierarchy references then the corresponding type cached in the data structure is obtained 
and compared to type Y at operation 430, The process returns true if the two types are 
equal. 

[0023] If the index is larger than the number of type hierarchy references in the sub-root 
log, then type Y is not cached within the data structure. The highest referenced type is 
obtained at operation 435 and repeat the process recursively from operation 415. 
[0024] An exemplary pseudo-code implementation for a JAVA language instruction to 
determine an object's type is included as Appendix A. The pseudocode of Appendix A 
begins by reverting to the recursive method of the prior art for the more complex cases 
where the query type (classT) is an array or an interface type. An array type is a type that 
comprises multiple components of another type and an interface type is a type without 
real features. In one embodiment, the recursive method of the prior art can be employed 
to handle these more complex types. 

[0025] In the foregoing specification, the invention has been described with reference to 
specific exemplary embodiments thereof. It will, however, be evident that various 
modifications and changes may be made thereto without departing from the broader spirit 
and scope of the invention as set forth in the appended claims. The specification and 
drawings are, accordingly, to be regarded in an illustrative sense rather than a restrictive 
sense. 
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APPENDIX A 

The exemplary pseudo-code below is an implementation for a JAVA language 
instruction to determine an object's type. The source object is ObjectX and the query 
type for checking is Class Y. 

if (! (Class Y is array | | Class Y is interface) ) { 
/ / get the depth in type hierarchy 
DepthY = getClassDepth ( ClassY ) ; 
/ / get type of instance ObjectX, jitted instructions start here 
ClassX = getClassType ( ObjectX ) ; 
if ( ClassX = = ClassY ) / / fastest path for common case 
return TRUE; 

retry: 

DepthX = getClassDepth ( ClassX ) ; 

/ / get the slot index in superclasses cache array, 

/ / here we use three slots for type hierarchy cache, 

/ / slot #0 for its father type, #1 for father's father, 

/ / #2 for father's father's father 

index = DepthX - DepthY; 

if (index < = 0) 

return FALSE; 
//SLOT_NUMBER== = 3 

if (index > SLOT_NUMBER) { // not cached here 
/ / recursively get father's father's father type 
ClassX = getSlot (SLOT_NUMBER -1); 
goto retry; 

} 

/ / get the cached type for real comparison 
getSlot (index - 1) = = ClassY; 
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